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ABSTRACT 

We present the initial results of our investigation of the star-forming complex W49, one of the youngest and 
most luminous massive star-forming regions in our Galaxy. We used Spifeer/Infrared Array Camera (IRAC) 
data to investigate massive star formation with the primary objective of locating a representative set of protostars 
and the clusters of young stars that are forming around them. We present our source catalog with the mosaics 
from the IRAC data. In this study we used a combination of IRAC, MIPS, Two Micron All Sky Survey, and 
UKIRT Deep Infrared Sky Survey (UKIDSS) data to identify and classify the young stellar objects (YSOs). We 
identified 232 Class O/I YSOs, 907 Class II YSOs, and 74 transition disk candidate objects using color-color 
and color-magnitude diagrams. In addition, to understand the evolution of star formation in W49, we analyzed 
the distribution of YSOs in the region to identify clusters using a minimal spanning tree method. The fraction 
of YSOs that belong to clusters with >7 members is found to be 52% for a cutoff distance of 96", and the 
ratio of Class II/I objects is 2.1. We compared the W49 region to the G305 and G333 star-forming regions and 
concluded that the W49 has the richest population, with seven subclusters of YSOs. 

Subject headings: infrared; stars — stars: early-type — stars: formation — stars: pre-main sequence 


1. INTRODUCTION 

It is generally accepted that approximately 70-90% of stars 
form in gr oups and clusters embedded in collapsing molecu¬ 
lar clouds (iLada & Ladal120031: iBressert et al.ll201^ . Obser¬ 
vations of nearby star-forming regions have revealed how a 
low-mass star can form in an isolated environment. However, 
massive stars play a vital role in the star formation process in 
embedded clusters, yet their own formation and their effects 
on subsequent generations of star formation are not well un¬ 
derstood. Massive stars form relatively quickly compared to 
low-mass stars and reach the main sequence still embedded 
in their natal clump. The short duration of this phase means 
that these objects are relatively rare and thus typically more 
distant, making them difficult to study. In addition, the em¬ 
bedded clusters hosting those massive stars are hidden inside 
their natal molecular cloud and can only be observed at in¬ 
frared (IR) and millimeter wavelengths. 

The launch of the Spitzer Space Telescope in 2004 
(IWerner et al.l l2004^ has had a big impact on our under¬ 
standing of the star formation process. Many nearby star¬ 
forming regions (e.g. Taur us complex, M16, M17, NGC 
6334) have been stud ied (iTorres k al.l 120091: iKuhn et aTI 
1201 3t IWillis et^l2013h . and many methods have been sug¬ 
gested and used to classify the young stellar objects (YSOs) 
throu gh their spectral indices (iLadal 119871 iRobitaille et al.l 
|2006^ or color-color diagrams (iGutermuth et al.l2008ll200^ 
The combination of Spitzer-lvifraxtd Array Camera (IRAC) 
(I Allen et al.l l2004t lEazio et ^ l2004t IGutermuth et al.l l2008t) 
and near-IR data has been a powerful tool to identify and clas¬ 
sify YSOs. 
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We performed a detailed inyestigation of the W49 star¬ 
forming region, which is located within one of the most 
massiye giant mol ecular clouds (GMCs) (Mgas ~ 10® Mq; 
ISimon et aH 1200 Ih in the Galaxy and hosts many mas¬ 
siye protostars and clusters of young stars that are forming 
around them. Since it is not possible to obserye a single 
YSO at yarious stages of its eyolution, we haye to obserye 
large numbers of Y SOs to study the pre-main-sequence eyo¬ 
lution statistically (iLada & AdamsI 119921 lAllen et ^ 120041 
IGutermuth et al.ll2009h. 

The W49 star-for ming region was discoyered as a radio 
continuum source by lWesterhoutI (|1958^ and lies in the Galac¬ 
tic plane (/,h = 43.°l,0.°0). The W49 GMC exten ds oyer more 
than 10 0 pc, assuming a distance of 11.1 kpc (IZhang et al.l 
120131) . iGalyan-Madrid et ^ (120131) deriyed a total mass 
Afgas "^1.1x10® Mq within a radius 60 pc and Mgas ^ 2 x 10® 
Mq within 6 pc from multiscale obseryations of CO. They 
concluded that the mass reseryoir of the molecular cloud is 
sufficient to form seyeral massiye star clusters, or a small sys¬ 
tem of smaller, but still bound, clusters. 

The W49 complex consists of two main components; a ther¬ 
mal source (W49A), which is a star-forming region, and a 
nonth ermal source (W49B ), identified as a supernoya rem¬ 
nant (iMezger et al.l Il967h . W49A is one of the most lu¬ 
minous star-forming regions in the Milky Way {Lu„\ = 10^'^ 


Lrr^; iSieyers et al.l 

1991 

, with hundreds of candidate OB 

stars (IDreher et al.l 

1984 

IDickel & Gossll 19901 IDe Free et al.l 


l2000t lAlyes & Homeien 120031) . and consists of the star¬ 
forming regions W49 north (W49N/G043.163-0.01), W49 
south (W49S), and W49 southwest (W49SW). W49N also 
hosts the most luminous wa ter maser in the Ga l axy, at a dis- 
tance of d= 11.4 ± 1.2 kpc (iGwinn et al.lll99^ . IZhang etTH 
(120131) redetermined this distance as 11. ® kpc by studying 

the water masers named G043.163-0.01 and G048.603-0.02. 
W49A hosts a ring of a dozen O stars and 25-30 ultracompact 
H II (UCHII) regions dPreher et al.l 1 1981 IWelch et al.l[T9^ 
iDickel & Gossl[l9^ iDe Free et al.lll99% . which may repre¬ 
sent a massiye s tar cluster. Its mass has b een estima ted at Mc\ 
> 4 xlO"' Mq (iHomeier & Alyesll2005l) . Recently. I Wu et all 
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(1201 4ft discovered a very massive 02-3.5 spectral type star in 
the central cluster of W49A. 

Several authors ha ve tried to explain t he nature of the star 
formation in W49A. IWelch et'dl (1 1987ft postulated a large- 
scale gravitational collapse toward the central ring of hyper¬ 
compact (HC) H II regions (Welch ring) in W49N, based on 
their observations of molecular lines that exhibit a d ouble- 
pea ked profile. On the other hand, ISerabvn et al.l (11993ft 
and iBucklev & Ward-Thomnsonl (119961) conclude that this 
double-peak line profile comes from different clouds and sug¬ 
gest that a cloud-clo ud collision is trigge ring the massive star 
formation in W49A. IWilner et al.l (12001ft found hot cores in 
the Welch ring, which are probably the precursors of UCHII 
regions. lAlves & Homeierl (12003ft identified four massive 
stellar clusters based on the spatial distributions of the de¬ 
tected sources based on their (H-Ks) colors, and they hypothe¬ 
size that the W49 GMC collapsed to form the central massive 
cluster, and stellar winds and UV radiation triggered the sur¬ 
rounding the cloud to form the Welch ring. However, they 
concluded that there is no evidence of triggering for the other 
clusters on the so uth and east parts of the region. In addition, 
iPeng et aTI (1201 Oft suggested that the triggering in the region is 
caused by expanding shells in the center of W49N. Recently, 
iGMvan-Madrid et'ffl (1201 3ft studied the mass distribution in 
the whole GMC and concluded that it shows a hierarchical 
network of filaments at scales from ~ 10 to 100 pc and sug¬ 
gested that the W49A starburst is formed from global gravi¬ 
tational contraction. They also concluded the feedback from 
the central young massive cluster is still not sufficiently strong 
to disrupt the GMC, and there is no evidence for significant 
disruption from photoionization. 

Here we present Spitzer IRAC imaging and photometric 
analysis of the W49 star-forming complex with deep IR data 
from 1 to 24 pm to investigate the massive YSOs (MYSOs) 
and embedded clusters forming around them. In Section|2]we 
describe the observations, data reduction techniques, and our 
near- and mid-IR source catalog and YSO classification. In 
Section |3] we present the clustering analysis, in Section 0] we 
present the SED fitting results for massive YSO candidates, 
in Section |5] we present the massive star formation tracers in 
the region, and in Section|6]we discuss the star formation his¬ 
tory in W49 and compare it to other star-forming regions such 
as G305 and G333. Finally, in Section|2]we summarize our 
results and describe our future work. 

2. OBSERVATIONS AND METHODS 
2.1. IRAC Imaging 

We assembled the mid-infrared Spitzer/IRAC observations 
of the W49 region obtained with the Spitzer IRAC instrument 
(iFazio et al.l 12004ft at 3.6, 4.5, 5.8, and 8.0 pm. We list the 
dates and coordinates of each Astronomical Observation Re¬ 
quest (AOR) in Table [T] The data are from several projects, 
including the following project IDs: 631 (PI: G. Fazio), 63 (J. 
Houck), 187 (GFIMPSE; E. Churchwell), and 80074 (Deep 
GFIMPSE; B. Whitney). The data set contained a total of 
11,592 images with a frame time of 2 s. Also, we used a 
total of 1088 images acquired in 12 s High Dynamic Range 
(HDR) mode, which performs consecutive individual obser¬ 
vations with exposure times of 0.4 and 10.4 s. Before per¬ 
forming the point-source detection and photometry, all the 
IRAC images were processed on an image-lw-image basis us¬ 
ing the routine imcleanQ which is an IRAFQ program for re¬ 


Table 1 

Astronomical Observation Requests 


AORKEY 

Date 

(UT) 

R. A. (J2000) 
{h m s) 

Decl. (12000) 

(O , //) 

IRAC 
Reduction 
Pipeline ver. 

7283968 

2003 Oct 02 

19;06;18 

8:17:36 

S18.25.0 

4389888 

2004 Apr 20 

19:10:14 

9:06:16 

S18.25.0 

11963904 

2004 Oct 09 

19:11:04 

9:27:41 

S18.25.0 

11972096 

2004 Oct 09 

19:10:21 

9:07:30 

S18.25.0 

11973376 

2004 Oct 09 

19:11:47 

9:47:52 

S18.25.0 

11966976 

2004 Oct 09 

19:09:39 

8:47:21 

S18.25.0 

11971072 

2004 Oct 09 

19:08:56 

8:27:04 

S18.25.0 

11968768 

2004 Oct 09 

19:12:30 

10:08:04 

S18.25.0 

11972864 

2004 Oct 09 

19:13:13 

10:28:15 

S18.25.0 

11018240 

2005 May 06 

19:11:09 

9:06:24 

S18.25.0 

11017984 

2005 May 06 

19:11:09 

9:06:24 

S18.25.0 

45934848 

2012 Nov 01 

19:06:24 

10:12:04 

S19.1.0 

45928960 

2012 Nov 26 

19:02:31 

8:20:54 

S19.1.0 

45949952 

2012 Nov 28 

19:03:00 

8:34:49 

S19.1.0 

45914880 

2012 Nov 28 

19:03:29 

8:48:43 

S19.1.0 

45909248 

2012 Nov 29 

19:03:58 

9:02:37 

S19.1.0 

45940224 

2012 Nov 30 

19:05:25 

9:44:14 

S19.1.0 

45923328 

2012 Nov 30 

19:04:56 

9:30:22 

S19.1.0 

45917184 

2012 Nov 30 

19:04:27 

9:16:30 

S19.1.0 

45945600 

2012 Nov 30 

19:09:54 

10:28:25 

S19.1.0 

45906176 

2012 Dec 01 

19:05:55 

9:58:09 

S19.1.0 


moving the bri ght source artifacts (“pulldown,” “m uxbleed,” 
and “banding’ ’: lHora et al.ll2004HPipher et al.ll2004h from the 
Basic Calibrated Data (BCD) images. 

Automated source detection and aperture photometry were 
carrie d out using PhotVis version 1.10 (iGutermuth et al.l 
12008ft . PhotVis utilizes a modified DAOnhot (iStetsonll 1987ft 
source-finding algorithm. Aperture photometry was per¬ 
formed with an aperture of 2.4" radius and using a back¬ 
ground annulus of inner and outer radii 2.4", and 7.2" respec¬ 
tively. Within an area of size A/ x Ab = l.°68 x l.°64, cen¬ 
tered at (/, b) = (43?2, -0?02), 332,442 sources were detected 
with IRAC photometry, and among these, 57,254 sources 
have photometry in all four IRAC filters. The photometric 
catalog is available in the electronic edition of this paper. 

The individual BCD images (processed with the Spitzer 
IRAC pipeline version S18.25.0 and S19.1.0) were mo- 
saicked into a large r image using the IRACproc package 
(ISchuster et al.ll200CT . IRACproc is a PDF script based on 
the Spitz er Science Center’s post -BCD processing software 
MOPEX (iMakovoz & Kharill2005ft . which has been enhanced 
for better cosmic-ray rejection. The full mosaic corresponds 
to a region with a size of approximately 360 by 360 parsecs 
at a distance of 11.1 kpc. The 5.8 pm grayscale image of the 
entire mosaic can be seen in Figure [T] and a color image of the 
GMC with its surroundings can be seen in the left panel of 
Figure 12 We show the central region W49A in detail and the 
supernova remnant W49B in the right panel of Figure|2] 

2.2. Completeness Estimate 

We estimated the completeness in each IRAC channel by 
adding artificial sources of various magnitudes to a sample re¬ 
gion (centered on 19:10:22, -(-9:07:35, l.°5xl.°5 in size). The 
artificial sources were added to the image using the observed 
IRAC point-spread function, scaled to the various magnitudes 
and added in a grid of positions in the image. We used the 


' IRAF is distributed by the National Optical Astronomical Observatories, 
operated by the Association of the Universities for Research in Astronomy, 
See http://irsa.ipac.caltech.edu/data/SPITZER/docs/dataanalysistools/tools/contfibutadftkicdonflemtive agreement with the National Science Foundation. 
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Figure 1. 5.8 /im grayscale image of the entire field analyzed in this study. Fields shown by the dashed boxes are where we have the deeper HDR images. The 
image is centered aX I, b = 43 P 2, -OP 02. 


same source-finding and photometry routines previously de¬ 
scribed to extract magnitudes for sources in the field. We then 
compared the result to the input data to determine the com¬ 
pleteness percentage and determined the photometric error for 
the artificial sources. The results of the completeness and er¬ 
ror estimates are shown in Figure [3 The 90% completeness 
magnitudes for IRAC channels 1, 2, 3, and 4 are 15.1, 14.7, 
12.35, and 12.12, respectively. The systematic error of the 
photometry was determined by the median value of the differ¬ 
ence between the calculated photometric measurement of the 
point-spread function (PSF) source and its scaled magnitude 
and is plotted as a function of magnitude for each of the bands 
in Figure[3 Figure|4]shows the photometric errors as reported 
by the Phot Vis photometry routine for the catalog sources. It 
can be seen in the 5.8 and 8.0 /im photometry error plots that 
the 12 s data provide smaller photometric errors, especially 
for magnitudes fainter than 12 mag. 

2.3. Source Catalog 

The source matching for the final catalog was performed in 
stages. First, all sources detected by Spitzer were matched 


to T wo Micron All Sky S urvey (2MASS) Point Source Cata¬ 
log (ISkrutskie et al.ll200^ sources by the PhotVis photometry 
routine. Since we have catalogs for mosaics based on short (2 
s) and long (12 s) frames, we ran a band-merging process and 
took all sources from the short frames brighter than a certain 
cutoff (magnitude 9.6, 9.4, 7.5, and 7.2 for channels 1, 2, 3, 
and 4, respectively), which was chosen to be near but below 
the saturation limit of the 12 s frames. 

Deep near-IR data are necessary, along with the IRAC 
photometry, in order to identify YSO candidates from other 
highly reddened objects and to more accurately fit models of 
their SEDs. Therefore, we used the UKIRT Infrared Deep 
Sky_Survey^R7PLUS (UKIDSSfl Galactic Plane Survey 
(iLucas et alJ2008h data, which are deeper and have better spa¬ 
tial resolution than those of 2MASS. We mer ged these data 
into our catalog using the TOPCAT software lTavlodl2005h 
pair match method with a maximum 1" radial tolerance. We 

* UKIDSS uses the UKIRT Wide Field Camera ICasali et alJ2007l) on the 
United Kingdom Infra red Telescope, and the UKIDSS project is defined by 
ILawrence et alJ <20071) . 
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Figure 2. The left panel shows a color image of the entire mosaic in the Spitzer IRAC bands (blue: 3.6 pm, green: 4.5 pm, red: 8.0 pm). The dashed line 
denotes the Galactic plane. The field shown by the dashed gray box is where we performed the clustering analysis, which is shown in detail in the right panel. 
The size scales in pc assume a distance to W49 of 11.1 kpc. 



MAG 



Figure 3. Top: estimate of the completeness of the IRAC data as a func¬ 
tion of magnitude for the survey. Bottom: error estimate as a function of 
magnitude, derived from the completeness analysis. 


adopted UKIDSS data, where they exist, for sources with 
2MASS magnitudes fainter than the UKIDSS saturation lim¬ 
its, and 2MASS data otherwise (the UKIDSS cameras saturate 
near 12.65, 12.5, and 12 mag in J, H, and Ks, respectively). 

UKIDSS data have small uncertainties and small but mea- 




w 


u d 
0 


Figure 4. Uncertainty in magnitudes versus magnitude in the source catalog 
for the IRAC 3.6 (top left), 4.5 (top right), 5.8 (bottom left) and 8.0 pm 
(bottom right) bands. Sources with errors above 0.3 mag are not plotted. 


surable zero-point photometric offsets from 2MASS. There¬ 
fore, we calculated the mean and standard deviations of the 
magnitude residuals between 2MASS and UKIDSS and ap¬ 
plied a mean offset to UKIDSS data of 0.02, -0.09, and - 
0.04 mag in J, H, and K^, respectively, to place them on the 
same system as the 2MASS photometry. Also, we found that 
the small photometric errors reported in the UKIDSS cata¬ 
log (e.g., <0.001 mag for 13 mag sources) are probably un¬ 
realistic and have a detrimental effect on the source classifi¬ 
cation process. We therefore also adjusted the UKIDSS er¬ 
rors by adding 0.02 mag in quadrature to the values from 
the UKIDSS catalog before the source classification process. 
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Table 2 

Summary of Source Catalog 


Band 

Number of Sources 

J 

305,584 

H 

321,973 

K, 

322,768 

3.6 fim 

310,485 

4.5 /im 

280,892 

5.8 /im 

93,896 

8.0 fim 

63,097 

24.0 fim 

2,268 


Table 3 

Source-matching results 


Catalog A 

Catalog B 

^'match 

N. matches 

IRAC 

2MASS 

1.0" 

221,104 

IRAC-I-2MASS 

UKIDSS 

1.0" 

322,778 

IRAC-i-(2MASSAJKIDSS) 

MIPS 

2.0" 

2268 


This imposes an error floor of 0.02 mag but does not affect the 
larger errors. We also used the MIPSGAL Archive 24 pm data 
(iGutermuth & Hev^l2015h and added them to our catalog by 
using the TOPCAT software pair match method with a max¬ 
imum 2" radial tolerance (since the resolution of the MIPS 
24 pm band is 6" FWHM, the position errors can be higher 
than IRAC). Also, we eliminated the bad matches when they 
are not i dentified as point sources in MIPS image (see Sec¬ 
tion IJA]). Our final catalog therefore contains photometry of 
sources over a wavelength range from 1.2 to 24 pm. A sum¬ 
mary of the source catalog and source-matching results can 
be found in Table |2] and in Table [3] respectively. The source 
catalog itself is presented in Table |5] 

2.4. YSO Classification 

In this study we used the selection method based on 
the co lor and magnitude criteria defined in IGutermuth et all 
(l2009h . This method uses flux ratios or colors to identify 
YSO candid ates. This method is an updated version of t hat in- 
troduced by IGutermuth et ahl (120081) . IGutermuth et al.l (120081 
|2009^ showed that the /rs-[3.6] vs. [3.6]-[4.5] color-color 
diagram is one of the best diagnostics for identifying and 
classifying YSOs. The J and H bands allow us to further 
extend the SED and sort out YSO candidates from highly 
reddened main-sequence stars and other background objects. 
With the near-IR and Spitzer bands, we can quickly iden¬ 
tify YSO candidates and determine their evolutionary status. 
We classify the YSOs into the categories of Class I sources 
(protostars with circumstellar disks and infalling envelopes) 
and Class II sources (pre-main-sequence stars with optically 
thick disks). In addition to these classes, “deeply embedded 
sources,” which are Class I sources with bright emission at 
24 pm, and “transition disks,” which are Class II sources with 
significant dust clearing within their disks, can be identified 
with this method. 

To identify the YS Os in W49, we first applied the 
IGutermuth et al.l (I2009^ criteria to the 332,442 point sources 
in our catalog. First, we separate out extragalactic contami¬ 
nants such as star-forming galaxies, broad-line active galactic 
nuclei (AGNs), and polycylic aromatic hydrocarbon (PAH) 
rich galaxies. Following Gutermuth’s criteria, star-forming 


galaxies are class ified from very red 5.8 and 8.0 pm colors 
dStern et ^120051) because of their strong PAH feature emis¬ 
sion. The [4.5]-[5.8] versus [5.8]-[8.0] and [3.6]-[5.8] 
versus [4.5]-[8.0] color-magnitude diagrams and a combi¬ 
nation of 8 different conditions based on all four IRAC bands 
are used to identify the star-forming galaxies and PAH galax¬ 
ies. Broad-line AGNs have mid-IR colo rs that are largely 
consistent with YSOs (IStern et al.l 120051) . Therefore, the 
AGNs are classified by using the [4.5] versus [4.5]-[8.0] 
color-magnitude diagram and a combination of six different 
conditions involving these two IRAC bands. 

We remove knots of shock emission and sources contami¬ 
nated by PAH emission using the [4.5] - [5.8] versus [3.6] - 

[4.5] color-color diagram and seven different conditions in¬ 
volving these three IRAC bands. Class I YSOs are identified 
using the same diagram and a combination of two conditions 
that show their discriminant colors. Finally, we extract the 
Class II YSOs from the remaining objects in the catalog us¬ 
ing the [4.5]-[8.0] versus [3.6]-[5.8] color-color diagram 
and a combination of four different conditions involving all 
four IRAC bands. 

With the application of each step of the full Gutermuth con¬ 
taminant object identification criteria, we selected a total of 4 
candidate broad-line AGNs, 53 PAH-rich galaxies, 11 knots 
of shocked gas emission, and 639 PAH-contaminated aper¬ 
tures, which are shown in Figure |5] 

After removing al l these contaminants, the second step in 
the IGutermuth et al.l (120091) method is to classify the YSOs 
with near-infrared data, and the third step is to reexamine the 
entire catalog with MIPS 24 pm photometry. After we applied 
the full set of criteria, we ended with 73 deeply embedded 
near-infrared objects, 271 Class I YSOs, 3021 Class II YSOs, 
and 231 transitional disk candidates. 

In our analysis of the W49 data we had to take into ac¬ 
count its greater distance and the possible higher level of 
contamination from objects such as AGB stars, background 
sources, and other extragalactic contaminants that have prob¬ 
ably been classified as YSOs. Since A GB stars are bright and , 
in general, slightly bluer than YSOs (iRobitaille et al J 120081 : 
iKoenig & Feisawita 120141) . we considered the bright YSOs 
that also follow the following selection criteria as candidate 
AGB stars: 

3 < [3.6] < 9.5 and 0.2 < [3.6] - [4.5] < 1.25 
or 

3.5 < [3.6] < 9.5 and 0.4 < [3.6] - [8.0] < 2.6 

The large cluster of objects at bright magnitudes in 

[3.6] - [4.5] versus [3.6] and [3.6] - [8.0] versus [3.6] 

color-magnitude diagrams are consistent with being AGB 
stars. With this criterion we classified 212 sources as can¬ 
didate AGB stars. _ 

The IGutermuth et alJ (120091) criteria require detections at 
5.8 or 8.0 pm to identify extragalactic and background con¬ 
taminants, and we have only 57,254 sources detected in all 
four IRAC bands. Therefore, for the rest of the sources that 
have been classified as YSO candidates in Phase II and have 
only IRAC 3.6 and 4.5 pm photometry, we had to think about 
possible extragalactic contaminants. Thus, we applied a se¬ 
lection cut of [3.6] = 13 mag to separate YSO candidates 
from potential background/foreground objects. However, at 
the distance to W49 that would also remove up to 90% of the 
low-mass YSOs that we are otherwise sensitive enough to de¬ 
tect. We report these additional YSO candidates separately to 
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Figure 5. Co lor—color and color-mag nitude diagrams used to identify contaminant objects among the sources with detection at all four IRAC bands following 
the criteria in IGutermuth et all <200911 . The background logarithmic gray scale indicates the overall source density in each color—color and color—magnitude 
space. In panels (a) and (b), PAH galaxies are marked with green circles. In panel (c), candidate AGNs are marked by red circles. Panel (d) shows knots of 
shocked emission (orange circles) and PAH-contaminated sources (blue circles). 

denote the lower confidence in their identification but do not 
remove them from our catalog. With this criterion we flagged 
64 Class I, 1932 Class II, and 75 transition disk candidates 
with a “uc” mark in our catalog, as shown in Table |5] and 
called them faint YSO candidates. These faint YSO candidate 
sources will not be used in the analysis described in Section 

El 

As a final step, we examined the regions near the YSO 
candidate objects and eliminated 74 transition disk and 27 
embedded source candidates if they did not appear as point 
sources in the MIPS 24 pm images, seem like an artifact 
around a very bright source, or are significantly more ex¬ 
tended than the FWHM for the 24 pm PSF. 

The color-color diagrams of the identified YSOs are shown 
in Figure |6] and the color-color diagrams combining MIPS 
and IRAC photometry and eliminated AGB star candidates 
and faint YSOs are shown in FigurelT] The points are plotted 
without dereddening their photometry. 

We also determined the slope of the SEDs in the mid-IR and 


examined the resulting class distribution of YSOs. The class 
identification for t he slope of lo g(AFy) vs. log(A) between 2 
and ^ 20 pm from iLadaf (1 198 7h is 

a > 0.3 Class I 
0.3 > a > -0.3 Flat Spectrum 
-0.3 > a > -1.6 Class II 
-1.6>a>-2.7 Class III 

In Figure [8] we show the identified YSO candidates with 
the distribution of their SED spectral index. The embed¬ 
ded sources have mainly positive spectral slopes, similar to 
Class I YSOs. Most of the IRAC-identified Class I YSO can¬ 
didates are found to have positive spectral index, indicating 
substantial infrared excess and confirming their identification 
as early stage YSOs. The IRAC-identified Class II YSOs are 
mainly grouped in the CII region as shown in EigurelH consis- 
tent with the classification according to the IGutermuth et alJ 
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-10123 

[4.5]-[5.8] 


-10123 

[5.8]-[B.O] 


Figure 6. IRAC color-color diagrams used for identifying YSO candicates in W49. Left: [3.6] - [4.5] vs. [4.5] - [5.8]; right: [4.5] - [5.8] vs. [5.8]-[8.0]. In 
both panels, red points are Class I, green points are Class II, blue points are transition disk candidates. 



[8.0]-[24] 



0 2 4 6 


[3.6]-[8.0] 


Figure 7. IRAC and MIPS color-color diagrams used for eliminating the AGB stars and faint objects that could be background/foreground contamination. Left: 
[3.6] —[5.8] vs. [8.0]-[24]; right: [3.6] vs. [3.6] —[8.0]. The red points are Class I; green points are class II; blue points are transition disk candidates; black 
points are class III and photospheres; gray points are AGB star candidates; and orange points are faint YSO candidates. 


(120091) method. The slope distribution for transition disk can¬ 
didates agrees with these sources being evolved YSOs. 

The full list of objects in Table|5]includes their classification 
(if available) and SED spectral index. In Table |4] we summa¬ 
rize the final source classification results. 

2.5. Comparison with WISE 

We assembled the WI SE data for the W 49 region from the 
AllWISE Data Release (iCutri et al.ll2013l) and used the YSO 
selection me thod that is based on color and magnitude crite¬ 
ria defined in iKoenig & Leisawitzl (120141) . We compared these 
data with the Spitzer catalog using the source positions, con¬ 
sidering objects a match if the distance between the WISE and 
Spitzer sources was less than 6". Nearly half of the IRAC 
YSOs did not have any match in the WISE point-source cata¬ 
log. Most of the IRAC YSOs that did not have correspond¬ 
ing entries in the WISE catalog are faint or unresolved in 
WISE. This includes the central portion of W49, where the 
source density is too high given the resolution of WISE to 
allow point-source detection among the bright nebular back¬ 
grounds. In the small number of overlapping sources, there 


are no obvious trends that explain all of the differences in clas¬ 
sification. Overall, WISE channel 1 measurements are a bit 
dimmer for all the YSOs than the IRAC channel 1 measure¬ 
ments. The IRAC channel 2 vs. WISE channel 2 has a lot of 
scatter (differences up to about 0.5 mag), but they look more 
centered around 0. The correspondence between the longer 
WISE and Spitzer wavelengths is not as close, but the YSOs 
appear much brighter at long wavelengths according to WISE 
than the Spitzer values. This effect could be due to the larger 
WISE beam and other sources or extended emission being in¬ 
cluded in the WISE photometry of the YSOs. 

We found that a total of 47 WISE YSOs (25 Class I and 
22 Class II) match to IRAC point sources. However, only 20 
of them match to the IRAC YSOs within 50" of the center of 
W49, shown in Eigure|9l Within this group of 20 WISE YSOs, 
12 Class I candidates match IRAC Class I candidates, 1 WISE 
Class I candidate matches an IRAC Class II candidate, 5 WISE 
Class II candidates match IRAC Class II candidates, and 2 
WISE Class II candidates match IRAC Class I candidates. The 
rest of the WISE YSO candidates (27 sources) match to IRAC 
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SED a 

Figure 8. Distribution of the SED spectral index a for sources identified as 
YSO candidates. The three vertical lines mark the division between the YSO 
regions based on their SED slopes. From left to right: Class II, flat spectrum, 
and Class I. 


Table 4 

Source Classification Summary 



Figure 9. IRAC and WISE YSO candidate distribution overlaid on the IRAC 
8 fim image. The IRAC YSOs are shown with the filled circles, with red 
for Class OA, green for Class II, blue for transition disks. The WISE YSO 
candidates are plotted as boxes, with red for WISE Class I and green for 
WISE Class IT 

dates in our further analysis. This conclusion may apply to 
other regions that are similarly distant or have a high density 
of sources. 


Class 

Number of Objects 

Class I 

186 

Class II 

907 

Transition disks 

74 

Embedded protostars 

46 

Faint Class T 

64 

Faint Class iT 

1932 

Faint transition disk 

75 

Photospheres 

247,142 

AGB candidates 

212 

Other ** 

705 

Unclassified 

81,097 

Total 

332,442 


These are the YSO candidates with lower confi¬ 
dence. 


Includes PAH-emission dominated sources, 
shocked-gas-emission-dominated sources, broad¬ 
line AGN candidates, and PAH galaxy candidates. 
Unclassified sources lack detection in four bands 
(either HKs, IRAC 1 and 2, or IRAC I, 2, 3 and 
4) or a bright MIPS 24 detection. 


AGB stars, photospheres, or PAH-contaminated source can¬ 
didates. The list of WISE YSOs that match with IRAC YSOs 
is given in Table |6] and the sources are shown on an 8.0 fim 
grayscale image in Figure |9] 

The source classification with WISE data indicates that the 
high density of sources in the central region, saturation, and 
source confusion prevent us from identifying YSO candi¬ 
dates. Differences in classification may also originate from 
the inaccurate photometry and lead us to identify some con¬ 
taminant sources as YSO candidates with the WISE data. Be¬ 
cause of these effects, we conclude that the WISE data are not 
very useful at identifying YSO candidates in the central part 
of W49, and therefore we did not include WISE YSO candi- 





















Table 5 
Source List 


Source Name^ 

R.A. 

J2000.0 

(°) 

Decl. 

J2000.0 

(°) 

J 

(mag) 

H 

(mag) 

K, 

(mag) 

[3.6] 

(mag) 

[4.5] 

(mag) 

[5.8] 

(mag) 

[8.0] 

(mag) 

[24] 

(mag) 

Type'’ 

Phase‘s 

Elag** 

SED 
slope a 

SSTOERC 

G043.0997-00.0296 

287.56134 

9.02556 

19.74(15) 

17.04(03) 

15.54(03) 


12.27(08) 



4.04(10) 

0 

0 


1.51 

SSTOERC 

G042.6828+00.2115 

287.15012 

8.76694 

17.13(02) 

16.03(02) 

14.90(02) 

11.56(02) 

10.37(01) 

9.04(02) 

8.18(02) 


1 

1 


1.88 

SSTOERC 

G042.2532+00.1846 

286.97418 

8.37315 

16.44(02) 

14.80(02) 

14.08(02) 

13.03(05) 

12.44(05) 

11.70(09) 

10.95(11) 

7.5(09) 

2 

1 


-0.47 

SSTOERC 

G042.8060+00.6517 

286.81198 

9.07888 

13.95(02) 

13.08(02) 

12.68(02) 

12.35(03) 

12.21(02) 

11.91(10) 

11.31(11) 

8.30(09) 

3 

1 


-1.72 

SSTOERC 

G043.2857-00.5262 

288.09421 

8.96086 

18.83(06) 

17.12(04) 

16.10(03) 

14.62(14) 

13.71(09) 




1 

3 

UC 

0.20 

SSTOERC 

G043.1518-00.9278 

288.39154 

8.65611 

15.53(02) 

14.94(02) 

14.31(02) 

13.93(06) 

13.20(18) 




2 

3 

uc 

-1.93 

SSTOERC 

G042.4240+00.1903 

287.04852 

8.52740 

15.81(02) 

14.59(02) 

13.96(02) 

13.35(07) 

13.36(09) 



8.27(10) 

3 

3 

uc 

-0.87 

SSTOERC 

G042.5060+00.5319 

286.78003 

8.75746 

17.55(01) 

15.56(02) 

14.67(02) 

13.84(03) 

13.91(03) 

13.44(11) 



99 

1 



SSTOERC 

G042.7282+00.4616 

286.94656 

8.92236 


18.70(14) 

17.30(09) 

14.55(13) 

13.64(08) 

12.40(13) 

11.55(12) 


29 

0 



SSTOERC 

G042.9654+00.6453 

286.89200 

9.21747 

15.85(02) 

13.88(02) 

13.02(02) 

12.33(01) 

12.43(02) 

10.51(05) 

12.18(13) 


19 

0 



SSTOERC 

G042.8351+00.3941 

287.05713 

8.98614 

5.62(02) 

5.09(05) 

4.88(03) 

6.64(01) 

4.93(00) 

4.90(00) 

4.85(00) 


9 

0 



SSTOERC 

G042.8251+00.6911 

286.78540 

9.11397 

14.91(02) 

12.76(02) 

11.83(02) 

11.24(01) 

11.43(01) 

11.03(05) 

11.04(06) 


20 

0 



SSTOERC 

G043.1583+00.8687 

286.78085 

9.49150 

17.29(03) 

13.29(02) 

10.26(02) 

6.48(00) 

5.75(01) 

3.87(00) 



12 

3 



SSTOERC 

G042.8033+00.6849 

286.78085 

9.09181 

17.43(03) 

16.15(03) 

15.60(03) 

14.79(12) 

14.28(12) 




4 

3 




Note. — Table 5 is published in the electronic edition of the Astrophysical Journal. A poition is shown here for guidance regarding its form and content. Values in parentheses signify eiTor in last 2 digits of magnitude value. 
Right ascension and Declination coordinates are J2000.0. 

^ Sources are named according to their Galactic longitude and latitude with the prefix SSTOERC, referring to Spitzer Space Telescope, Origin and Evolution of Rich Clusters project. 

^ 0; deeply embedded, 1; Class I, 2; Class II, 3; transition disks, 99; Class III and photospheres, 9; shocked gas emission, 29; AGNs, 19; PAH galaxies, 20; PAH dominated sources, 12; AGBs, -100; unclassified. 

^ Sources are flagged according to the phase they were classified. L iGutermuth et ^120091) classification Phase I, 2; Phase II, 0; Phase III, 3; our AGBs, faint YSO/background contamination phase (see Section 2.4). 

^ YSO candidates near a bright IRAC source within 35^^ are flagged with a question mark. Faint YSOs which can be background/foreground contamination are flagged with the letter “uc”. 
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3. CLUSTERING ANALYSIS 
3.1. Spatial Distribution ofYSOs 

Figure|9]shows the spatial distribution of the YSOs overlaid 
on the 8.0 p,m grayscale image of the region. The sources are 
colored according to their evolutionary class: blue for transi¬ 
tion disks, green for Class II, and red for Class O/I candidates. 
In the central W49A region, it is clear that Class O/I sources 
and Class II sources are clustered together. In the whole re¬ 
gion, however, the Class O/I sources are clustered, or in hl- 
amentary form, while the Class II sources are more evenly 
distributed. 


3.2. Minimal Spanning Tree (MST) 

To understand the star formation history in W49 and com¬ 
pare W49 to other star-forming regions, we want to identify 
individual groups or clusters ofYSOs and compare their prop¬ 
erties such as sizes and ages. In order to hnd the clusters, w e 
used a method called MST jCartwright & Whitworthll20^ . 
which uses the spatial distribution of sources to determine a 
cluster membership without any kinematic information. 

The MST method dehnes clusters as a collection of stars 
that are connected to each other by branches smaller than 
the cutoff distance (dc) or branch length and with a mini¬ 
mum number of stars (N) in a group. It has been a popular 


forming regions dKoenig et al. 

2008 

IGutermuth et al. 

20091; 

IBeerer et alJl2010l IBillot et al. 

2011 

; IChavarrfa et al.l 



However, there is no unique way to determine a cutoff dis¬ 
tance for a cluster determination. 

The most common way used for nearby regions is to plot the 
distribution of cutoff distances, ht straight lines through the 
long and short cutoff distance domain s, and choose the point 
of intersection (iGutermuth et al.l2008^ . Choosing a cutoff dis¬ 
tance that falls between these two domains is also a good way 
to separate c lusters from distribute d sources (IGutermuth et al.l 
l200d l2009h . With this method, iBeerer et akl (1201 ol) deter- 
mined the cutoff distance as 61" (0.43 pc) for the Diamond 
Ring region in the Cygnus-X star-forming co mplex, which is 
at a di stance of 1450 pc. On the other hand. iGuarcello et alJ 
(1201 3h derived the cutoff distance as 72" (0.51 pc) for the 
Cygnus-OB2 region in the Cygnus-X complex. All these val¬ 
ues are smaller than the 88" (0.86 pc) derived bv iKoenig et alJ 
(l2008h for the W5 star-forming region, which lies at a distance 
of 2 kpc, which may indicate that Cygnus-X is more densely 
p opulated with YSOs tha n W5. 

IChavarrfa et al.l (120141) used the MST method for nearby 
embedded star-forming regions by dividing each region ac¬ 
cording to their YSO concentrations. For W5-east they found 
an average cutoff distance of 33" (0.32 pc), w hich is differ¬ 
ent th an the 88" (0.86 pc) that was found b y iKoenig et al.l 
JMil) for the whole of W5. iBillot et al.l (1201^ used the MST 
metho d according to the dehnition given in (Gutermuth et al.l 
(l2009h for Galactic star-forming regions like W43 and Vu- 
lOBl to derive their clustering properties and found cutoff 
distances of ~80" (corresponding to 4.3 pc at the distance 
of W43) and ^100" (5.4 pc), respectively. These cutoff dis¬ 
tance differences may be partly a resolution effect, since at 
greater distances multiple stars may blend into single objects, 
and fewer stars are detected owing to corresponding lower 
sensitivity in the more distant regions. Low-resolution obser¬ 
vations of distant star-forming regions probe only larger-scale 
structures, and it is hard to see the hierarchical substructure 
in these regions. In addition to Galactic regions, the MST 


method has been used by iBastian et al.l (l2007h for the star¬ 
forming regions in the M3 3 galaxy. In order to prevent an ar- 
tihcial length scale according to the resolution limit and see¬ 
ing a large number of stars as blended or as single sources, 
they used cutoff distances from 5" to 65" (corresponding to 
19 to 252 pc assuming the distance of M33 as 800 kpc) in 13 
equal steps. Examining the clustering at the different values 
allowed them to see all levels of hierarchy in M33. 

3.2.1. W49 Clustering Analysis with MST Method 

To compare with previous clustering analysis in nearby star¬ 
forming regions, we hrst use the straight-line ht method to de¬ 
termine the branch length cutoff for W49. We plot the num¬ 
ber of clustered objects versus branch length. Branch lengths 
between 82" and 130" deviated signihcantly from linearity, 
so we masked these areas and found the intersection to the 
best hts to the long and short branch length regimes. We 
found a cutoff distance of 96", which corresponds to 5.2 pc at 
11.1 k pc, similar to the cutoff distance found by IBillot et aH 
(1201 ll) for W43 and VulOBl. However, this cutoff distance 
shows that the straight-line ht method does not hnd the iso¬ 
lated subclusters, instead picking up almost the whole star¬ 
forming region with a diameter of 27 pc associated with the 
overdensity of YSOs. 

To determine the statistical signihcance of the W49 clusters 
identihed with this cutoff distance, we performed simulations 
to examine the clustering properties of randomly distributed 
objects. We created 1000 distributions of 231 objects spread 
randomly in a 0.5 x 0.5 degree held to match the number of 
YSOs and the size of the central region of W49. We examined 
the number of groups that would occur in a random distribu¬ 
tion as a function of the minimum group size. We chose the 
minimum group size that gave the fewest random groups for 
a given branch length. 

We found that for a cutoff distance of 96" and a minimum 
group size N = 1, our random distribution of YSOs will yield 
on average nine groups or subclusters. We looked at the group 
with the highest number of members in each simulation and 
found that these largest groups had an average of 22±6 mem¬ 
bers, with a maximum of 78. For our observed distribution 
we see a much smaller number of clusters, hnding three con¬ 
centrated groups of YSOs containing 52% of the observed 
sources using a 96 " cutoff distance. The population of the 
largest observed group (Cluster 1, G43.15-0.01) is 97, signif¬ 
icantly higher than the maximum seen in simulated random 
distributions. Cluster 2 (G43.33-0.08) has 15 members, sim¬ 
ilar to the 16±2.9 average size of observed for random distri¬ 
butions of objects. Cluster 3 (G43.31-0.20) has nine mem¬ 
bers, slightly smaller than the 13±2.0 found for the third- 
largest random subcluster. The remaining 110 YSOs (48%) 
did not belong to any of these clusters and are considered to 
be part of the distributed population. 

The identihed clusters in W49 are shown in the left panel of 
Figure[TT]and their properties are summarized in Table|7] The 
distribution of cutoff distances versus the number of sources is 
plotted in the left panel of Figure [TO] To see how the numbers 
of groups change according to the cutoff distance, we plotted 
the number of groups containing seven or more stars versus 
cutoff distance from 1" to 300", with steps of 1", in the right 
panel of Figure ITOl 

This comparison indicates that the largest group observed, 
cluster 1, is likely formed from the extended YSO popula¬ 
tion of W49, representing a large-scale structure extending 
26.86 pc in diameter. The lower-density clusters 2 and 3 oh- 
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Table 6 

WISE YSO Source List 


WISE R.A. 
(°) 

WISE Decl. 

r) 

Wl 

(mag) 

W2 

(mag) 

W3 

(mag) 

W4 WISE 

(mag) Class^ 

Matched IRAC 

Source Name 

IRAC 

Class“ 

287.2152079 

9.2737751 

14.05(21) 

12.36(07) 

8.90(04) 

5.99(06) 

1 

SSTOERC G043.1622+00.3880 

1 

287.4789811 

9.8163322 

12.33(06) 

9.93(03) 

5.45(02) 

2.05(02) 

1 

SSTOERC G043.7637+00.4076 

1 

287.2595223 

9.5982765 

13.05(07) 

11.38(04) 

8.83(07) 

5.70(05) 

1 

SSTOERC G043.4703+00.4988 

1 

286.9496205 

9.3442128 

11.55(09) 

10.27(02) 

6.91(02) 

3.95(02) 

1 

SSTOERC G043.1041+00.6530 

1 

287.8935132 

8.8461252 

10.48(03) 

9.07(02) 

6.72(04) 

4.48(05) 

1 

SSTOERC G043.0923-00.4037 

1 

287.9997854 

9.1612432 

11.48(03) 

9.66(02) 

7.24(04) 

4.84(03) 

1 

SSTOERC G043.4202-00.3508 

1 

287.7786347 

9.6947734 

11.65(05) 

9.5(03) 

5.90(02) 

2.43(02) 

1 

SSTOERC G043.7923+00.0895 

1 

287.9451674 

9.7828376 

12.20(05) 

10.68(03) 

7.48(05) 

3.98(05) 

1 

SSTOERC G043.9462-00.0153 

1 

287.5414301 

8.4239783 

11.63(03) 

10.60(03) 

8.12(05) 

5.34(04) 

2 

SSTOERC G042.5570-00.2899 

1 

287.5407128 

8.6191474 

11.04(03) 

9.32(02) 

6.11(02) 

3.85(03) 

1 

SSTOERC G042.7298-00.1991 

1 

287.3331588 

8.5458962 

10.43(02) 

9.22(02) 

7.06(03) 

5.73(08) 

2 

SSTOERC G042.5701-00.0509 

1 

286.9115172 

8.4531359 

11.19(04) 

9.62(02) 

7.06(02) 

4.80(05) 

1 

SSTOERC G042.2956+00.2765 

1 

287.6502571 

8.6810581 

9.53(02) 

6.61(02) 

4.13(01) 

1.76(02) 

1 

SSTOERC G042.8347-00.2668 

1 

287.8018156 

8.5427571 

9.17(02) 

7.29(02) 

5.20(01) 

3.90(02) 

1 

SSTOERC G042.7814-00.4635 

1 

288.0424475 

8.9366108 

9.34(02) 

8.89(02) 

7.68(03) 

5.34(06) 

2 

SSTOERC G043.2406-00.4921 

2 

287.9274058 

9.8193046 

9.70(02) 

8.85(02) 

6.37(02) 

4.02(03) 

2 

SSTOERC G043.9705+00.0170 

2 

287.9060792 

9.8514046 

11.08(03) 

9.68(02) 

7.47(04) 

4.74(06) 

1 

SSTOERC G043.9892+00.0504 

2 

288.2291642 

9.2840378 

11.59(04) 

10.46(04) 

8.25(05) 

5.49(05) 

2 

SSTOERC G043.6338-00.4946 

2 

287.8718301 

8.4157962 

11.98(03) 

11.13(03) 

8.81(06) 

6.18(06) 

2 

SSTOERC G042.7008-00.5836 

2 

287.9626502 

8.6556335 

12.02(04) 

11.61(04) 

9.01(09) 

5.90(07) 

2 

SSTOERC G042.9550-00.5526 

2 


^ 0: deeply embedded; 1: Class I; 2; Class II candidate sources. 


served with the straight-line fit cutoff distance of 96 " have a 
higher probability of occurring randomly as a result of chance 
alignment of distributed YSOs. 

To investigate the substructure within these clusters, we 
also investigated the number and size distribution of subclus¬ 
ters identified as a function of the cutoff distance used. For 
a random distribution, the number of groups identified in the 
minimum spanning tree is a function of both the minimum 
group size and the cutoff distance. A long cutoff distance and 
small minimum group size will break the MST into a large 
number of small, more dispersed clusters. We examined the 
number and size distributions of clusters that would arise in 
a randomly distributed population of YSOs to find the cut¬ 
off distance and minimum group size that yielded the largest 
number of groups that had a low probability of occurring ran¬ 
domly. 

We found that using a break length of 40" (2.2 pc in W49) 
and requiring A > 6 or more members isolates seven sub¬ 
clusters, a distribution that did not occur in the 1000 simu¬ 
lated randomly distributed trials. Only 8.8% of the random 
trials had at least one group of six or more members, and only 
0.3% of the random YSO distributions had two groups, with 
no trials having more than two random groups. Therefore, 
the random trials indicate that the identified subclusters likely 
represent physically associated groups of YSOs, with a <10% 
likelihood that one of the subclusters could be a random asso¬ 
ciation. Note also that the subclusters identified are all within 
the three clusters identified with the larger break length and 
minimum cluster member number. However, owing to the 
complex line of sight toward W49, without additional data to 
determine the distance to each object, we cannot rule out the 
possibility that all of the clusters may not all be at the same 
distance as the W49 GMC. 

According to the YSO clustering analysis, identified sub¬ 
clusters in W49 have sizes of order a few parsecs in diame¬ 
ter with six to nine sources in each subcluster. On the other 
hand, subclusters identified in the Cygnus-X DR21 region 
have 10-148 sources with diameters of 1-8.1 pc (iBeerer et al.l 
120101) . The same study finds subclusters in the AFGL 2636 re¬ 


gion with diameter s of 1.1 -3.6 pc with 12-149 sources. Simi¬ 
larly, l^enig^erai] (|200^ found subclusters in the W5 region 
with 10-201 sources with diameters of a few parsecs. W49 
has smaller clusters with less members; however, this might 
be a result of incompleteness. It also should be noted that 
these studies used different criteria in their subcluster analy¬ 
ses. To co mpa re W49 to other previously observed regions, 
in Section 16.31 we examine the clustering properties of the 
star-forming regions G305 and G333 with the same method 
used for W49 (after rescaling to account for their different 
distances) and then compare their clustering properties in de¬ 
tail. 

The properties of each W49 subcluster are shown in Ta¬ 
ble 1^ and subclusters are shown in the right panel of Fig- 
ure[TT] All identified groups and clusters from both large and 
small cutoff distances can be seen in Figure [12] and the sum¬ 
mary of the results can be seen in Table |9| with the physical 
scale assuming that the clusters are at 11.1 kpc. 

3.2.2. VSO Densities in Clusters 

Figure [13] shows the YSO nearest-neighbor distance in the 
whole cloud complex, with the YSO candidates in identified 
clusters overlayed on each panel. The YSO nearest-neighbor 
distance grayscale images show the YSO nearest-neighbor 
distance in arcseconds, with white regions being higher den¬ 
sity. Panel (a) shows the plots with the YSO candidate pop¬ 
ulation, and Panel (b) shows the same maps but includes the 
faint YSOs/possible contaminant sources. It can be seen in 
Panel (b) that there is an overdensity of faint YSOs at the po¬ 
sition of W49B, potentially indicating an associated low-mass 
star cluster, although it could be a chance alignment. 

4. SED MODELS OF MYSOS 

We have a pplied an SED fitting method described in 
lAzimlu et al.l (120151) to the 231 YSO candidates in the MST 
clusters in order to identify the most massive candidates in 
our sample and obtain a rough estimate of their properties. 
The most common method to find an SED model is to com¬ 
pare the models to data points, find the value using the 
photometric errors, and pick the model that minimizes the 
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Figure 10. Left: The MST branch length distribution. The straight lines represent the linear fit to the points smaller and larger than the chosen branch length. 
The point of intersection dc = 96", and the minimum number of stars N = 1. Right: Number of groups containing seven or more stars identified by the MST 
algorithm. 


Table 7 

Properties of the Clusters in W49 GMC for dc = 96" 


No. 

Cluster 

Name 

R.A. (J2000.0) 
(°) 

Decl. (J2000.0) 
(°) 

N/r“ 

I 

II 

II/I*’ 


N,/ 

Diameter 

(") 

Diameter 

(pc) 

1 

G43.15-0.01 

287.56496 

9.074502 

97 

31 

57 

1.84(0.41) 

9 

0 

497.88 

26.86 

2 

G43.33-0.08 

287.71438 

9.202276 

15 

2 

10 

5.00(3.87) 

1 

2 

244.17 

13.17 

3 

G43.31-0.20 

287.808112 

9.134392 

9 

2 

7 

3.50(2.81) 

0 

0 

176.38 

9.52 


^ Number of stars with infrared excess. Includes Class I, Class II, deeply embedded protostars, and transition disk candidates. 
^ Number in parentheses indicates Poisson uncertainty in ratio. 

^ Number of deeply embedded protostars. 

^ Number of transition disk candidates. 
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Figure 11. Left: thi'ee MST clusters in the W49 determined using a cutoff distance of 96'' (5.2 pc for objects at the distance of W49). Right: seven MST 
subclusters identified with > 6 and dc = 40" (2.2 pc). Objects in clusters are colored according to their group, while sources not associated with any clusters 
are plotted in gray. 
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Table 8 

Properties of the Clusters in W49 GMC for dc = 40" 


No.'* 

Cluster 

Name 

R.A. (J2000.0) 
(°) 

Decl. (J2000.0) 
(°) 


I 

II 

iiyi* 


N,/ 

Diameter 

(") 

Diameter 

(pc) 

la 

G43.17-0.00 

287.571256 

9.102715 

9 

6 

3 

0.50(0.35) 

0 

0 

70.53 

3.81 

lb 

G43.10-0.04 

287.57085 

9.017707 

8 

3 

3 

1.00(0.82) 

2 

0 

88.15 

4.76 

Ic 

G43.19-0.02 

287.597988 

9.105411 

7 

1 

6 

6.00(6.48) 

0 

0 

34.26 

1.85 

Id 

G43.153-0.03 

287.532889 

9.096616 

7 

1 

6 

6.00(6.48) 

0 

0 

50.86 

2.74 

le 

G43.133-0.06 

287.497549 

9.089443 

6 

1 

3 

3.00(3.46) 

2 

0 

60.49 

3.26 

If 

G43.23-0.05 

287.639709 

9.138874 

6 

2 

4 

2.00(1.73) 

0 

0 

38.57 

2.08 

2a 

G43.31-0.08 

287.705104 

9.185597 

6 

1 

4 

4.00(4.47) 

1 

0 

79.40 

4.28 


^ Subclusters la, lb, Ic, Id, le, and If coirespond to cluster 1; subcluster 2a corresponds to cluster 2. Clusters 1, 2, and 3 are determined by the cutoff 
distance dc = 96" 

^ Number of stars with infrared excess. Includes Class I, Class II, deeply embedded protostars, and transition disk candidates. 

Number in parentheses indicates Poisson uncertainty in ratio. 

^ Number of deeply embedded protostars. 

^ Number of transition disk candidates. 



Figure 12. Three MST clusters identified using dc = 96" are shown with black grou^umbers overlaid on the IRAC 8 /rm image. The seven subgroups identified 
by dc = 40" are shown with a letter and the cluster number they belong to (see Tableland Table[^. Massive YSO candidates {M > 8 Mq) are shown with green 
crosses. Distributed YSO candidates not assigned to groups are plotted with gray points. 
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120.7 121.3 122.1 123.4 125.4 128.6 133.8 142 154.9 175.3 207.7 259.8 341.3 470.3 


Figure 13. YSO nearest-neighbor distance (in arcseconds) in the W49 cloud complex, with the YSO candidates in identified clusters overlayed on each 
panel. Panel (a) shows the YSO nearest neighbor distance in the W49 cloud complex. Panel (b) shows the YSO nearest-neighbor distance including the 
faint YSOs/possible contaminants. The position of the W49B supernova remnant is indicated by the red circle. 
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Figure 14. Positions of UCHII regions jPe Pree et alJ199'7h in the W49 field are shown with magenta diamonds on the IRAC 3.6 /xm image with the associated 
YSOs. Red circle s show the YSO s in subcluster la, and blue circles show the YSOs in subcluster Id. The cyan circle mai'ks the location of the O type 
star discovered bvlWu et al.l j2014h . Green crosses mark the massive YSO ca ndidates, cyan cros ses 89, 90, 91, and 92 mark the massive dust clumps from 
IMatthews et al.l <2Q09l) . and blue crosses mark the methanol masers identified bv IBreen et al.l <20151) . 


This b est-fit method is described in detail by iRobitaille et al.l 

( | 200 ^ . 

With a limited amount of photometric data, in order 
to explore all the possible solutions, we first applied the 


lAzimlu et al.l (12015h method, which uses a Markov Chain 
Monte Carlo method to explore the larger parameter space of 
the model grid. We applied this method to the set of YSO can¬ 
didates and found that 78% have masses < 8 Mq, and 60% of 
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Table 9 

Clusters/Subclusters in W49 


Parameter 

Straight-line 

Fit 

II 

0 

Number of clusters 

3 

7 

Cutoff distance 

96"(5.2 pc) 

40" (2.2 pc) 

Percent in clusters 

52 

21 

Group size/diameter 

176"-498" (9-27 pc) 

34"-88" (1-5 pc) 

Cla.ss II/I Ratio 

2.11(0.43)* 

2.00(0.64)* 


Number in parentheses indicates Poisson uncertainty in ratio. 


the YSO candidates have masses between 2 and 6 M 0 . 

Within the set of MYSOs found to be > 8 M 0 , we selected 
the ones that have at least six photometric data points to limit 
the fitting to the bette r-constrained objects, and we used the 
iRobitaille et alJ (^2007^ online SED fitter to obtain the physical 
parameters from the best fit. This selection also favors the 
brighter and possibly more massive objects in our sample. We 
identified 16 MYSO candidates with this selection method, 
which are shown in Figure [12] with green crosses. 

The SED fitter was used to model the available data 
(2MASS/UKIDSS J, H, and Ks, SpitzerHRAC3.6, 4.5, 5.8, 
and 8.0 /rm, and SpitzerlMWSi 24 pm). We focused on only 
the mass, luminosity, and age parameters; all of the modeled 
parameters are likely to have a high uncertainty without a finer 
sampling of the object’s SED. To provide some indication of 
the model accuracy, we report the results from the average of 
the best 10 fit models and their standard deviation for the YSO 
candidates that have masses larger than 8 M 0 in Table [TOl 

Most of the MYSO candidates are classified as Class I, 
which shows that they are recently formed. MYSO candi¬ 
dates 1 and 16 were also previousl y identified as YSOs in the 
Red MSX Source Survey (RMS; lUrauhart et al.ll20T^ . An 
H II region S at the position of MYSO 5 and another H II re¬ 
gion at the position of MYSO 9 were previously identified in 
the same survey. Within 15" near MYSO 5 there is another 
H II region and a methanol maser, which are both indicating 
that there is ongoing massive star formation (Figure [T4li. 

5. MASSIVE STAR FORMATION TRACERS 

A large fraction of stars and clusters form mostly at the 
peripheries of H II regions, and it has been suggested that 
H II regio ns can enhance and trigger the massive star forma - 
tion (e.g.. lDeharveng et'ani2006ll2008l ; IZavagno et al.ll2006h . 
Since W49 is a massive star-forming complex that hosts many 
UCHII regions, it is expected to have massive star formation 
around those locations. As can be seen in Figure [T4| most 
of the H II regions are located within a 1 pc area around the 
Class I candidate SST OERC G043.1684+ 00.0087, where the 
UCHII region W49 J (iDe Free et al.lll99'^ is located. Four of 
the H II regions seem positionally associated with YSO candi¬ 
dates, including one MYSO candidate. The close correspon¬ 
dence between some of the YSO candidates and the UCHII 
regions may indicate that some of the YSOs may be related 
to the driving object behind these UCHII regions, or at least 
formed in the same cluster. 

Embedded UCHII regions and MYSOs have simi lar mid-IR 
colors and very similar ages (iMottram et al.llMTTl) ; however, 
we do not see more associations, contrary to what we expect 
from these massive star formation tracers. Several of the H II 
regions are very bright in the IRAC images, and a couple are 
saturated in the 8.0 pm band. However, most of the H II re¬ 
gions do not have an associated YSO candidate. In some cases 


we were not able to classify these sources because of the lack 
of detection in at least four of our photometric bands. For 
example, it can be seen in Figure [T4| that many of the HII re¬ 
gions close to W49 J do not have bright mid-IR counterparts, 
and the sources in this region would be difficult to separate 
at this resolution. It also can be seen that many of the IR- 
identified candidate MYSOs do not have associated UCHII or 
dust clump sources, indicating that the sources are perhaps in 
different evolutionary stages. 

In addition to HII regions, methanol masers are also 
known to be associated with high-mass s tar-forming regions 
(iPestalozzi et al.l 120021: iBreen et ^l2013h. The positions of 
the 6.7 GHz methanol masers identified bv iBreen et alJ(l2015l) 
in W49 (G43.149 h-0.013, G43.165h- 0.013, G43.167-0.004, 
G43.17lH-0.005, and G43.175-0.015) are shown in Figured! 
with blue crosses. These methanol masers appear as unclassi¬ 
fied in our catalog according to their lack of photometry; how¬ 
ever, G43.167-0.004 seems positionally associated with H II 
region P and/or one of our Class I YSO candidates which does 
not appear to be an MYSO candidate according to the SED 
modeling. Three of the methanol masers seem also position- 
ally associated with previously identified H II regions, as can 
be seen in Figure [l4l These sources are perhaps in a younger 
stage of evolution where we cannot yet trace them with IR 
photometry. We can conclude that high-resolution observa¬ 
tions in the infrared, submillimeter, and radio are necessary to 
put together a comprehensive picture of the star formation in 
the central region of W49. 

6. DISCUSSION 

6.1. Star Formation in W49 GMC 

lAlves & Homeierl (120031) found four clusters associated 
with radio sources W49S, W49N, S, and Q and hypothesized 
that the GMC collapsed to form the central massive cluster 
(W49N) and stellar winds and UV radiation triggered the for¬ 
mation of the Welch ring. However, they did not find any evi¬ 
dence of triggering for other clusters on the south and east part 
of the region (W49S, S, and Q). Also, they indicate that com¬ 
pact HII regions with short lifetimes can be found over the en¬ 
tire region, providing evidence for multiscaled, largely coeval 
star formation in W49A. In our clustering analysis, we found 
that the large cluster 1 (G43.15-0.01) represents almost the 
whole GMC, including the well-known Welch ring, W49S, 
and clusters S and Q. 

There is significant correspondence b etween our YSO 
subclu sters and the clusters identified by lAlves & Homeied 
(120031) . Subcluster la (G43.17-0.00) corresponds to their 
cluster 1 (“Extended”), while their cluster 2, W49A South, is 
unresolved in the IRAC images and detected as a single YSO 
candidate (SSTOERC G043.1651-00.0285). Their cluster 3, 
corresponding to UCHII region S, is also identified as a sin¬ 
gle high-mass Class I YSO candidate in our data, SSTO¬ 
ERC G043.1518 h- 00.0115. Their cluster 4, corresponding to 
UCHII region Q, is bright in the IRAC images but did not 
meet the YSO selection criteria. 

The other subclusters we have identified extend to the east 
and west of W49A and do not correspond to previously identi¬ 
fied near-infrared stellar clusters. In total we identified seven 
subclusters of YSOs within 30 pc of W49A using the MST 
method. Subclusters la, lb, Ic, Id, le, and If correspond to 
cluster 1 which has a Class II/I ratio of 1.87. Within this clus¬ 
ter, the youngest subcluster la, corresponding to the central 
region of W49A, and subclusters Ic and Id lie near the out- 
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Table 10 

Physical Parameters of Massive YSO Candidates 


No. 

Name 

R.A. 

J2000.0(°) 

Decl. 

J2000.0(°) 

Mass 

(Mq) 

a 

(M0) 

Luminosity 

(lO^L©) 

a 

(IO^Lq) 

Age 

(10®yr) 

cr 

(10'’yr) 

Class 

1 

SSTOERC G043.0772+00.0038 

287.520782 

9.021031 

12.45 

1.29 

12.77 

3.61 

0.84 

0.76 

1 

2 

SSTOERC G043.0942-00.0388 

287.566986 

9.016427 

10.15 

1.99 

3.41 

1.08 

0.11 

0.15 

1 

3 

SSTOERC G043.1017-00.0373 

287.569153 

9.023827 

9.57 

0.87 

5.51 

1.74 

2.25 

0.70 

1 

4 

SSTOERC 0043.1139+00.0169 

287.526184 

9.059647 

7.35 

0.50 

0.74 

0.19 

0.08 

0.03 

2 

5 

SSTOERC G043.1518+00.0115 

287.548737 

9.090796 

18.96 

3.61 

34.37 

23.66 

0.02 

0.02 

1 

6 

SSTOERC G043.1578+00.0314 

287.533691 

9.105292 

10.41 

2.09 

5.38 

2.46 

0.16 

0.15 

1 

7 

SSTOERC G043.1716+00.0017 

287.566772 

9.103776 

8.90 

0.63 

4.23 

0.96 

1.30 

0.22 

1 

8 

SSTOERC G043.1719-00.0114 

287.578735 

9.098016 

11.13 

2.14 

9.48 

6.26 

0.37 

0.49 

1 

9 

SSTOERC G043.1761 +00.0245 

287.548401 

9.118363 

14.14 

3.26 

20.10 

11.01 

0.48 

0.31 

1 

10 

SSTOERC G043.1837+00.0185 

287.557343 

9.122244 

12.52 

2.23 

10.01 

3.58 

0.16 

0.12 

1 

11 

SSTOERC G043.1860+00.0026 

287.572754 

9.117006 

13.37 

2.40 

16.46 

7.27 

0.52 

0.39 

1 

12 

SSTOERC G043.2096+00.0426 

287.547821 

9.156357 

8.09 

0.74 

3.27 

0.84 

2.53 

0.94 

1 

13 

SSTOERC G043.2099+00.0315 

287.557983 

9.151517 

14.60 

0.00 

19.20 

0.00 

1.30 

0.00 

2 

14 

SSTOERC G043.2942-00.1658 

287.774597 

9.135176 

12.03 

4.35 

13.96 

8.54 

0.93 

0.59 

2 

15 

SSTOERC G043.3542-00.0991 

287.742798 

9.219168 

9.63 

0.00 

5.50 

0.00 

3.0 

0.00 

3 

16 

SSTOERC G043.0885-00.0114 

287.539703 

9.024038 

17.30 

3.45 

34.03 

16.18 

0.17 

0.11 

1 


sid e edge of the mid- infrared double-ring structure proposed 
by iPeng et aTI (120101) . The double-ring structure was inter¬ 
preted as sites of massive star formation triggered by feedback 
from the central stars in W49A. Because of the large uncer¬ 
tainty in the Class II/I ratios due to small number statistics, 
the relative ages of subclusters according to the ratio of II/I 
will not be discussed. 

Although clusters 2 and 3 do not have significantly more 
members than clusters that appear in our random trials, we 
note that both of these clusters contain MYSO candidate ob¬ 
jects, which do not appear outside of our identified clusters. 
Cluster 2 contains one of the subclusters (2a) that are signif¬ 
icant and do not appear outside of the main clusters found at 
the larger break lengths. It is also associated with a region 
of enhanced extended IR emission that can be seen in Fig¬ 
ure IH Cluster 3 is adjacent to and may be associated with 
the supernova remnant W49B. The class II/I ratio of cluster 3 
is 3.50±2.81 and may indicate an apparent age intermediate 
b etween cluster 1 a nd clu ster 2. 

iMatthews et al.l (I2009^ identified dust clumps associated 
with W49 molecular clouds with continuum emission obser¬ 
vations at 850 fim with 15" resolution by using SCUBA/James 
Clerk Maxwell telescope. The dust clumps 98 and 100 seem 
associated with the subcluster lb (G43.10-0.04) with their 
given distances between 11.1 and 11.4 kpc determined from 
'^CO(l-O) and HI data bv iMatthews et al.l (120091) . The rest of 
the dust clumps, 83, 84, 85, 86, 87, 89, 90, 91, 92, 93, 94, 
95, 96, 97, 101, 102, 103, and 104, correspond to the region 
where we identified cluster 1, and they are shown in Figure [Tsl 
with blue crosses. The dust clumps 89, 90, 91, and 92, which 
are shown in Figure fl4l have the biggest masses between 3.5 
X 10"^ -2.8 xlO^ Mq, and it was noted by IMatthews et al.l 
(l2009h that assuming a dust temperature of 15 K, these mas¬ 
sive clumps will eventually form star clusters via fragmenta¬ 
tion. Dust clumps shown with orange crosses in Figure [TSl 
w ere determined to be at distances between 4.03 and 7.45 kpc 
bv iMatthews et al. I (120091) . which might indicate that the clus¬ 
ters 2 and 3 are not associated with the W49 complex. Follow¬ 
up observations will be necessary to verify the classification 
and distance of these YSO candidates and confirm their clus¬ 
ter membership. 

6.2. Line-of-siglit Cloud and W49 MST Clusters 


ISimon et all Jlo^ showed that the GRSMC 43.30-0.33 
cloud, which was identifie d with the Galactic Ring Survey 
(GRS: iJackson et al.ll200^ . is near the line of sight to W49 
at a distance of 3 kpc and largely overl aps W49. We used 
the integrated intensity CO maps from ISimon et alJ (120011) 
to compare the spatial distributions of YSOs in each cloud. 
We show the YSO clusters with the CO map contours in 
Figure [T5] Cluster 1 (G43.15-0.01) and the subclusters la, 
lb, Ic, Id, le, and If seem strongly associated with the 
strongest CO emission in the W49 GMC. Cluster 2 (G43.33- 
0.08) does not seem strongly associated with the CO emis¬ 
sion, and it is roug hly in line with a dust clump at a distance 
of 7.45-7.30 kpc (IMatthews et al.l 120091) . However, it con¬ 
tains an MYSO candidate (SSTOERC G043.3542-00.0991) 
and subcluster 2a (G43.31-0.08) which seems more strongly 
associated with the CO emission than the rest of the cluster. 

Similarly, on the line of sight of cluster 3 (G43.31-0.20), 
there is a dust clum p at a distance of 4.37-4.10 kpc 
(IMatthews et al.ll?009l) that might be associated with the fore¬ 
ground cloud GR SMC 43.30-0.33 The Class I YSO candi¬ 
date closest to the IMatthews et all (I2009h dust clump is offset 
by 8" from a group of infrared sources that are partially re¬ 
solved into four peaks of emission within a 4.5" aperture in 
the 3.6 and 4.5 p,m images but unresolved at 5.8 and 8.0 /rm. 
However, there is an MYSO candidate in that cluster (SSTO¬ 
ERC G043.2942-00.1658) that could be associated with the 
CO emission coming from the W49 cloud, as can be seen in 
Figure [12] and Figure [15] There are no clusters that seem to 
be associated with the brightest CO emission from GRSMC 
43.30-0.33 like we see in W49, and with it being closer to us, 
we would expect to be able to detect lower-mass and lower- 
luminosity objects from clusters associated with the fore¬ 
ground cloud. Because of its projected position on the sky, 
cluster 3 might be thought to have the greatest chance of be¬ 
ing a foreground object, potentially associated with GRSMC 
43.30-0.33 at 3 kpc, the dust clump at 4.37-4.10 kpc, or the 
W49B supernova remnant re gion at more than 10 kpc, as pre¬ 
viously mentioned in Section l3.2.2l 

6.3. Comparison with Other Star-forming Regions 

The 11.1 kpc distance to W49 limits our sensitivity to de¬ 
tect only the brightest and most luminous YSO candidates. To 
better understand W49 in the context of other regions form- 
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Figure 15. *^CO(1-0) integrated intensity contours iSimon et alJI200lll and YSO clusters overlayed on CO maps in gray scale. Left: *^CO(1-0) intensity is 
integrated over the velocity range associated with GRSMC 43.30-0.33 {u^sr = 35 — 50 km s“*) and contour levels are 2.5, 6.25, 10 K km s“*, shown in blue; 
right: *^CO(1-0) intensity is integrated over the velocity range associated with W49 GMC (ursr = -5 — 25 km s“*), and contour levels are 2.5, 14, 25.5, 37, 48.5 
and 60 K km s“' for W49 GM C, shown in black. Blu e crosses show the dust clumps determined at 11.1—11.4 kpc, and orange crosses show the dust clumps 
determined at 4.10-7.30 kpc bv IMatthews et alld2009f) . The red circle indicates the position of the supernova remnant W49B. 


ing rich clusters, we investigate how the YSO populations of 
luminous star-forming complexes G305 and G333 would ap¬ 
pear if they were at the greater distance of W49. Taken to¬ 
gether, these three regions also probe different evolutionary 
stages, from deeply embedded sources forming protoclusters 
to already-revealed open clus ters. __ 

G305 {l,b = 30.5.°4.-1-0.° 1: Iciark & Ported iTtiol is one of 
the most massive (M > 6 x lO^Mf^: iHindson et al.l 120101) 
star-forming complexes in the Galaxy, and is located in the 
Scutum-Crux arm at an estimated distance of ^4 kpc. The 
complex has a ringlike morphology of bright mid-infrared 
emission extending over ^34 pc that surrounds two optically 
visible open clusters, Danks 1 and Banks 2. The G305 com¬ 
plex has a considerable amount of ongoing star formation 
around the periphery of its oldest object, Danks 2 (3!!!i Myr), 
and the younger Danks 1 (l.Siliog) (iDavis et al.ll2012^ . 

G333 (also known as RCW 106; = 333.°3,-0.°4) lies 

at a distance of 3.6 kpc (iGarcfa et al.ll20l4) and extends over 
50 pc. It contains a number of H II regions and MYSO can¬ 
didates, with an estimated bolometric luminosity \og(Li,oi) = 
6.28, making it one of the most luminous mass ive star¬ 
forming regions in the Galaxy (lUrauhart et al.ll20T^ . 

Using the Spitz er mid-infrared photometry catalog from 
IWillis et al.1 (HSl, we identified 1189 and 1057 YSO can¬ 
didates in G305 and G333, respectiv ely, by using the classi¬ 
fication method described in Section l2~4l and performed the 
clustering analysis as described in Section lTTl We investigate 
the clustering properties of G305 and G333 both at their as¬ 
sumed distances and as they would appear if projected to the 
same distance as W49. 

These regions have a rich population of YSOs, and we iden¬ 
tified 15 YSO clusters with this MST method in the G305 
complex using the straight-line fit method determining dc = 


Table 11 

Clusters/Subclusters in G305 


Parameter 

Straight-line 

Fit 

Ngrp 

Maxi 

Number of clusters 
Cutoff distance 
Percent in clusters 
Group size 

Class II/I ratio 

15 

48" (0.9 pc) 

76 

36"-911" (1-18 pc) 
3.5(0.3)* 

32 

30" (0.6 pc) 

44 

9"-295" (0.2-6 pc) 
3.0(0.3)* 

Number in parentheses indicates Poisson uncertainty in ratio. 

Table 12 

Clusters/Subclusters in G333 

Parameter 

Straight-line 

Fit 

Ngrp 

Maxi 

Number of clusters 
Cutoff distance 
Percent in clusters 
Group size 

Class II/I ratio 

11 

73" (1.3 pc) 

81 

110"-1353" (2-24 pc) 
4.5(0.4)* 

45 

52" (0.9 pc) 

54 

43"-306" (1-5 pc) 
4.0(0.4)* 


Number in parentheses indicates Poisson uncertainty in ratio. 


48" and we set > 7 members. With the same method, we 
found 11 YSO clusters with dc = 73" in G333. In order to see 
the hierarchical structure, we also used a short cutoff distance 
that maximizes the number of clusters, identifying the clus¬ 
ters shown in Figure [161 The clustering results for these two 
regions at their original distances are shown in Table [TT] and 
in Table [T 2 I 

For a large branch length cutoff, the probability of the ob- 
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Figure 16. Left: subclusters identified in the G305 star-forming region (using dc = 30") are overlaid on the IRAC 5.8 f^m image. Right: subclusters identified in 
the G333 star-forming region (using dc = 52”) are overlaid on the IRAC 5.8 /rm image. Distributed YSO candidates not assigned to groups are plotted with gray 
points. 


served clustering properties occurring randomly is relatively 
high. In G305 the most significant result is the size of the two 
largest clusters, which occured in less than 0.1% of our simu¬ 
lations. In G333 the largest cluster is smaller (378 members) 
and therefore has a higher probability of occurring randomly 
(7%). 

The results for both clusters are more significant with a 
shorter cutoff distance. The probability of the G333 sub¬ 
clusters occurring randomly is 3.3%-18.3%, indicating that a 
larger number of these structures may be randomly connected 
sets of objects. This is supported by the morphology of the 
G333 subclusters, which display a significant number of sub¬ 
clusters with very linear or filamentary appearance that occur 
randomly. The probability of the number and sizes of G305 
subclusters occurring randomly is <0.1%, and the clusters 
also have a strong association with features in the extended 
IRAC 5.8 /im image, including subclusters associated with 
both the open clusters Banks 1 and Banks 2. 

We determined the projected appearance of the G305 and 
G333 YSOs by rescaling the relative source positions and 
magnitudes for a distance shift from 3.6 kpc (G333) and 4 kpc 
(G305) to the 11.1 kpc distance to W49. We applied a limit¬ 
ing magnitude of 20.3, 19.3, 18.2, 18.6, 16.3, 14.8, 14.3, and 
9.4 mag for J, H, K, 3.6, 4.5, 5.8, 8.0, and 24 pm, respec¬ 
tively, which are the faintest detections in the W49 field with 
error smaller than 0.2 mag. After reprojecting the source co¬ 
ordinates, we did not find any overlapping sources within the 
same photometric aperture (2") at the rescaled distance. 

After re scali ng, we reclassified all the sources as described 
in Section 123] We do not assume any additional reddening, 
so the colors of the objects do not change after projecting. 
However, sources may be below the applied limiting magni¬ 
tude from the W49 catalog in one band and above in others, 
resulting in a change in the source classification based on the 
new, more limited wavelength coverage. 

The new projected catalog for G305 contains 696 YSOs, 
only 59% of YSOs from the original catalog, and the pro¬ 
jected catalog for G333 contains 404 YSOs, only 38% of the 


original YSOs. In Section 1231 we applied a cut for sources 
with [3.6] > 13 to remove likely background objects from the 
YSO sample. When we apply the same cut to the projected 
G305 YSOs, the resulting population is 6.9% of the original 
size, and for G333 the resulting YSO population after apply¬ 
ing the faint-object cut is 7.9%. 

The field of view for G305 and G333 was approximately 
the same at their original distances, 0.5° x 0.5°. After re¬ 
projecting G305, the area decreases to 0.18° x 0.18° and the 
G333 area decreases to 0.16° x 0.16°. We applied the MST 
clustering analysis to the G333 and G305 YSO populations at 
both their original and projected distances, comparing to sim¬ 
ulated random clusters for each set of YSO population and 
observed or project ed area parameters. 

In Section 13.2.11 the straight-line fit method for W49 ap¬ 
peared to identify GMC-scale clusters. For the rescaled clus¬ 
ters without cutting off sources that fall below the [3.6] = 
13 mag cut, we used only the straight-line fit method. For 
the rescaled G305 we derived the cutoff distance as 22" (cor¬ 
responds Xo dc= 1 .2 pc, at a distance of 11.1 kpc) and for the 
rescaled G333 we derived the cutoff distance as 33" (corre¬ 
sponds to dc = 1.8 pc, at a distance of 11.1 kpc). For G305 
and G333 the reprojected straight-line fit cutoff distance finds 
only one cluster per region, with a clustered fraction of 74% 
in rescaled G305 and 81% in G333. 

Finally, we examined the clusters in the rescaled G305 and 
G333 as they would appear if we removed the sources fainter 
than [3.6] = 13. The straight line-fit cutoff distance without 
faint sources was 52" for G305 and 78" for G333. Although 
these are shorter cutoff distances than W49, the clusters found 
are not highly significant. The largest cluster in rescaled G333 
has only M members, only half the size of cluster 1 in W49. 
The chance of the largest rescaled G333 cluster occurring ran¬ 
domly was 3.2%, compared to < 0.1% for cluster 1 in W49. 
The G305 rescaled clusters were smaller, but because of their 
shorter cutoff distance, they were also more significant. The 
largest cluster in rescaled G305 had only 32 members, a size 
that was met in only 0.2% of our simulated rescaled G305 
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clusters. 

To directly compare the sizes and numbers of the clusters 
in W49 to G305 and G333, we also determined the clustering 
properties in G305 and G333 using the W49 parameters from 
the straight-line fit {dc = 96" with N >1) and the peak that 
maximized the number of groups {dc = 40" with N > 6). We 
found only one cluster with dc = 96", and although the clus¬ 
ters are larger than for the best-fit cutoff distance for G305 
and G333 individually, they are still smaller than the largest 
cluster in W49. This indicates that W49 has a larger popula¬ 
tion of high-luminosity YSOs than either G333 or G305. With 
dc = 40" we found three clusters in G333 and five clusters in 
G305, with similar size distributions to W49 clusters la-lf 
and 2a, as can be seen in Figure [17] There were three clus¬ 
ters in only 0.3% of the rescaled G333 simulations, and there 
were five clusters in only 0.2% of the rescaled G305 simula¬ 
tions. The number and size distributions of the subclusters in 
G305, G333, and W49 are all significant, although W49 again 
has the richest population, with seven subclusters found. In 
W49, six of the seven clusters are found within an area of ap¬ 
proximately 0.15° X 0.15°, comparable to the projected area 
of G305 and G333 and indicating that the larger number of 
tightly packed clusters in W49 is not simply due to the larger 
field. The W49, G305, and G333 clustering properties and 
associated probabilities are all summarized in Table [TI] 

6.4. Class //// Ratio as a Cluster Age Indicator 

As seen in Table [l4] the Class II/I ratios in G305 and G333 
change when the YSOs are reprojected to W49 distance, and 
because of its higher Class II/I ratio, W49 appears oldest and 
the G305 region appears to be the youngest. However, as we 
mentioned in Section 1631 G305 has an older population with 
its two open clusters. This drop in Class II/I ratio in G305 and 
G333 regions after rescaling to the greater distance is mainly 
because lower-luminosity Class II YSOs drop below our de¬ 
tection limits before Class I YSOs do, biasing our observa¬ 
tions of distant regions toward younger objects. This effect 
must be taken into account when comparing clouds at differ¬ 
ent distances using the Class II/I ratio as an age indicator. 

7. SUMMARY 

In this paper, we combined Spitzer IRAC (3-8 /rm), MIPS 
(24 pm), and UKIDSS near-IR {JHKs) data for the W49 
GMC to identify and classify YSOs by using their infrared 
colors and magnitudes and analyzed their clustering proper¬ 
ties according to their spatial distribution across the region. 
We found the following numbers of YSO candidates: 186 
Class I, 907 Class II, 74 transition disks, and 46 deeply em¬ 
bedded protostellar sources. We used the MST method to 
identify the groups and subclusters in the region and found 
that 52% of YSOs (including transition disk objects) belong 
to clusters of >7 members in the W49 GMC. In order to assess 
the significance of the identified MST clusters, we performed 
simulations on randomly distributed YSOs and examined the 
probability of finding random clusters. We found that clus¬ 
ter 1 represents a large-scale structure in the cloud extending 
^27 pc in diameter, with a very low probability of occurring 
in a random distribution of sources. This cluster is centered on 
the previously identified main W49 region with many tracers 
of high-mass star formation detected. We also used a smaller 
cutoff distance to investigate the hierarchical structure in the 
cloud, finding several subclusters within the larger clusters. 
The distribution of identified subclusters has a very low prob¬ 
ability of occurring randomly. 


We applied an SED fitting method to 231 YSO candidates 
in the MST clusters in order to identify the most massive can¬ 
didates in our sample and found 16 YSO candidates that have 
masses >8 Mq. We found that the massive Class I candidate 
SSTOERC G043.1518 h- 00.0115 is located at the position of 
the UCHII region W49 S. However, we could not find any 
more associations because most of the H II regions are un¬ 
classified in our catalog owing to lack of photometry. We also 
examined previously identified methanol masers in the region 
that are known as good indicators of high-mass star formation. 
However, we do not find any IR-identified MYSO candidates 
that appear physically associated with those masers. 

We compared the W49 region to two other star-forming re¬ 
gions, G305 and G333. In our cluster analysis we used the 
cutoff distances and the ratio of class II/I sources in each cloud 
in order to derive information about their relative ages and 
YSO densities. In order to directly compare them, we de¬ 
termined the projected appearance of the G305 and G333 re¬ 
gions and compared the number and size distributions of the 
subclusters. We saw that there are more clusters in the W49 
region, which indicates that W49 has the richest population 
of high-luminosity YSOs. However, we could not use the ra¬ 
tio of Class II/I objects as an age indicator of clusters, since 
at greater distances we lose a significant number of older ob¬ 
jects, which biases our observations. This comparison also 
indicates that there is potentially a large number of YSOs in 
W49 that we have not detected. 

This work is based in part on observations made with the 
Spitzer Space Telescope, which is operated by the Jet Propul¬ 
sion Laboratory, California Institute of Technology, under a 
contract with NASA. Support for this work was provided by 
NASA. This publication makes use of data products from 
the Two Micron All Sky Survey, which is a joint project of 
the University of Massachusetts and the Infrared Process¬ 
ing and Analysis Center/California Institute of Technology, 
funded by the National Aeronautics and Space Administra¬ 
tion and the National Science Eoundation. The authors grate¬ 
fully acknowledge the referee for useful comments and addi¬ 
tional discussion. G. S. acknowledges partial support from 
NASA Grant NNX12AI60G and Istanbul University grant 
BAP50195 since this work is part of her PhD thesis research. 
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Table 13 

W49, G305, and G333 Clustering Comparison 


Region^ 

Cutoff Distance 

First Cluster Size'’ 

Second Cluster Size 

Third Cluster Size 

Groups > 

W49 

96" 

97 

15 

9 

3 

Straight-Line Fit 


<0.1% 

66 .6% 

99.3% 

99.9% 

W49 

40" 

9 

8 

7 

7 

Peak 


0 .2% 

<0.1% 

<0.1% 

<0.1% 

G333 

73" 

378 

309 

39 

11 

Straight-line fit 


7% 

<0.1% 

99% 

99.9% 

G333 

52" 

40 

39 

37 

45 

Peak 


3.3% 

<0.1% 

<0.1% 

< 0.1% 

G305 

48" 

418 

328 

26 

15 

Straight-line fit 


<0.1% 

<0.1% 

99.9% 

99.9% 

G305 

30" 

113 

78 

49 

32 

Peak 


<0.1% 

<0.1% 

< 0.1% 

< 0.1% 

G333 rescaled 

78" 

51 

6 


2 

Straight-line fit 


3.2% 

99% 


99.9% 

G333 rescaled 

52" 

24 

18 


2 

Peak 


<0.1% 

<0.1% 


65% 

G305 rescaled 

52" 

32 

10 

10 

4 

Straight-line fit 


0 .2% 

29.3% 

4.9% 

19.3% 

G305 rescaled 

40" 

19 

10 

10 

5 

Peak 


0 .1% 

0 .2% 

<0.1% 

0 .2% 

G333 rescaled 

96" 

57 



1 

W49 straight-line dist. 


55.8% 



99.9% 

G333 rescaled 

40" 

13 

8 

7 

3 

W49 peak dist. 


0 .2% 

<0.1% 

0 .1% 

0.3% 

G305 rescaled 

96" 

74 



1 

W49 straight-line dist. 


85% 



99.9% 

G305 rescaled 

40" 

19 

10 

10 

5 

W49 peak dist. 


0 .1% 

0 .2% 

< 0.1% 

0 .2% 


^ Rescaled G305 and G333 do not contain sources below the [3.6] = 13 cut 
Percentage is showing the probabilty of the clusters to occur randomly. 

^ N,„i„=6 for peak and N„,i„=l for straight-line fit 
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Figure 17. Left: subclusters identified in the rescaled G305 star-forming region using the W49 cutoff distance dc = 40". The YSOs are plotted in colors 
according to the clusters identified. The distributed YSO candidates not assigned to groups are plotted with gray points. Right: same as the left panel, but for 
G333. 
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Table 14 

Clustering Analysis Summary 


Region^ 

Cutoff 

Distance(") 

Cutolf 

Distance (pc) 

Class II/I 
Ratio’’ 

W49 

96 

5.2 

1.8(0.4) 

G305 

48 

0.9 

3.5(0.3) 

Rescaled G305 

96 

5.2 

0.5(0.1) 

G333 

73 

1.3 

4.5(0.4) 

Rescaled G333 

96 

5.2 

1.0(0.3) 


^ W49 parameters from straight-line fit = 96” have been used here 
to direetly compare the biggest clusters in the resealed regions without 
sources fainter than [3.6] = 13 

^ Number in parentheses indicates Poisson uncertainty in ratio. 
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